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We present a new characterization of termination of general logic programs. Most existing ter- 
mination analysis approaches rely on some static information about the structure of the source 
code of a logic program, such as modes/types, norms/level mappings, models/interargument rela- 
tions, and the like. We propose a dynamic approach which employs some key dynamic features of 
an infinite (generalized) SLDNF-derivation, such as repetition of selected subgoals and recursive 
increase in term size. We also introduce a new formulation of SLDNF-trees, called generalized 
SLDNF-trees. Generalized SLDNF-trees deal with negative subgoals in the same way as Prolog 
and exist for any general logic programs. 
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1. INTRODUCTION 

For a program in any computer language, in addition to having to be logically 
correct, it should be terminating. Due to frequent use of recursion in logic pro- 
gramming, however, a logic program may more likely be non-terminating than a 
procedural program. Termination of logic programs then becomes an important 
topic in logic programming research. Because the problem is extremely hard (un- 
decidable in general), it has been considered as a never-ending story; see [Schreye 
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and Decorte 1993] for a comprehensive survey. 

The goal of termination analysis is to establish a characterization of termina- 
tion of a logic program and design algorithms for automatic verification. A lot of 
methods for termination analysis have been proposed in the last decade. A ma- 
jority of these existing methods are the norm- or level mapping-based approaches, 
which consist of inferring mode/type information, inferring norms/level mappings, 
inferring models/interargument relations, and verifying some well-founded condi- 
tions (constraints). For example, Ullman and Van Gelder [Ullman and Gelder 
1988] and Plumer [Plumer 1990b; 1990a] focused on establishing a decrease in term 
size of some recursive calls based on interargument relations; Apt, Bczem and Pe- 
dreschi [Apt and Pedreschi 1993; Bezem 1992], and Bossi, Cocco and Fabris [Bossi 
et al. 1994] provided characterizations of Prolog left-termination based on level 
mappings/norms and models; Verschaetse [Verschaetse 1992], Decorte, De Schr- 
eye and Fabris [Decorte et al. 1993], and Martin, King and Soper [Martin et al. 
1997] exploited inferring norms/level mappings from mode and type information; 
De Schreye and Verschaetse [Schreye and Verschaetse 1995], Brodsky and Sagiv 
[Brodsky and Sagiv 1991], and Lindenstrauss and Sagiv [Lindenstrauss and Sagiv 
1997] discussed automatic inference of interargument/size relations; De Schreye, 
Verschaetse and Bruynooghe [Schreye et al. 1992] addressed automatic verification 
of the well-founded constraints. Very recently, Decorte, De Schreye and Vandecas- 
teele [Decorte et al. 1999] presented an elegant unified termination analysis that 
integrates all the above components to produce a set of constraints that, when 
solvable, yields a termination proof. 

It is easy to see that the above methods are compile-time (or static) approaches 
in the sense that they make termination analysis only relying on some static in- 
formation about the structure (of the source code) of a logic program, such as 
modes/types, norms (i.e. term sizes of atoms of clauses) /level mappings, mod- 
els/interargument relations, and the like. Our observation shows that some dynamic 
information about the structure of a concrete infinite SLDNF-derivation, such as 
repetition of selected subgoals and recursive increase in term size, plays a crucial 
role in characterizing the termination. Such dynamic features are hard to capture 
by applying a compile-time approach. This suggests that methods of extracting 
and utilizing dynamic features for termination analysis are worth exploiting. 

In this note, we present a dynamic approach by employing dynamic features of 
an infinite (generalized) SLDNF-derivation to characterize termination of general 
logic programs. In Section 2, we introduce a notion of a generalized SLDNF-tree, 
which is the basis of our method. Roughly speaking, a generalized SLDNF-tree is 
a set of standard SLDNF-trees augmented with an ancestor-descendant relation on 
their subgoals. In Section 3, we define a key concept, loop goals, which captures 
both repetition of selected subgoals and recursive increase in term size of these sub- 
goals. We then prove a necessary and sufficient condition for an infinite generalized 
SLDNF-derivation in terms of loop goals. This condition allows us to establish a 
dynamic characterization of termination of general logic programs (Section 4). In 
Section 5, we mention the related work, and in Section 6 we conclude the paper 
with our future work. 
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1.1 Preliminaries 

We present our notation and review some standard terminology of logic programs 
as described in [Lloyd 1987]. 

Variables begin with a capital letter, and predicate, function and constant sym- 
bols with a lower case letter. A term is a constant, a variable, or a function of the 
form f(T\, T m ) where / is a function symbol and the TjS are terms. An atom is 
of the form p(Ti, ...,T m ) where p is a predicate symbol and the T^s are terms. A 
literal is of the form A or ->A where A is an atom. Let A be an atom/term. The 
size of A, denoted \A\, is the number of occurrences of function symbols, variables 
and constants in A. By we denote a sequence A\, A 2 , A n . Two atoms, 

A and B, are said to be variants if they are the same up to variable renaming. 

Lists are commonly used terms. A list is of the form [] or [T\L] where T is a 
term and L is a list. For our purpose, the symbols [, ] and | in a list are treated as 
function symbols. 

Definition 1.1. A (general) logic program is a finite set of clauses of the form 
A <— Li, L n 

where A is an atom and LiS are literals. When n = 0, the "<— " symbol is omitted. 
A is called the head and L\, ...,L n is called the body of the clause. If a general 
logic program has no clause with negative literals like ~^A in its body, it is called a 
positive logic program. 

Definition 1.2. A goal is a headless clause <— L\, ...,L n where each literal Li is 
called a subgoal. L\,...,L n is called a (concrete) query. 

The initial goal, Go =<— L\, L n , is called a top goal. Without loss of generality, 
we shall assume throughout the paper that a top goal consists only of one atom, 

1. e. n = 1 and L\ is a positive literal. 

Definition 1.3. A control strategy consists of two rules, one for selecting a goal 
from among a set of goals and the other for selecting a subgoal from the selected 
goal. 

The second rule in a control strategy is usually called a selection or computation 
rule in the literature. To facilitate our presentation, throughout the paper we choose 
to use the best-known depth-first, left-most control strategy (used in Prolog) to 
describe our approach (It can be adapted to any other fixed control strategies). So 
the selected subgoal in each goal is the left-most subgoal. Moreover, the clauses in 
a logic program are used in their textual order. 

Trees are commonly used to represent the search space of a top-down proof 
procedure. For convenience, a node in such a tree is represented by Ni : d where 
Ni is the name of the node and d is a goal labeling the node. Assume no two 
nodes have the same name. Therefore, we can refer to nodes by their names. 

2. GENERALIZED SLDNF-TREES 

In order to characterize infinite derivations more precisely, in this section we extend 
the standard SLDNF-trees [Lloyd 1987] to include some new features. We first 
define the ancestor-descendant relation on selected subgoals. Informally, A is an 
ancestor subgoal of B if the proof of A needs (or in other words goes via) the proof 
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of B. For example, let M :<— A, A\, A m be a node in an SLDNF-tree, and 
N :<— Si, -B„, Ai, A TO be a child node of M that is generated by resolving 
M on the subgoal A with a clause A <— B\, ...,B n . Then A at M is an ancestor 
subgoal of all BiS at A. However, such relationship does not exist between A at M 
and any Aj at N. It is easily seen that all BiS at N inherit the ancestor subgoals 
of A at M. 

The ancestor-descendant relation can be explicitly expressed using an ancestor 
list. The ancestor list of a subgoal A at a node M, denoted ALa@M) is of the 
form {(#!,£>!), (N h A)} (/ > 0), where for each (N,D t ) G AL A@M , TV, is a 
node name and Di a subgoal such that Di at N t is an ancestor subgoal of A at 
M. For instance, in the above example, the ancestor list of each Bi at node N 
is ALb { @n = {(M, A)} U ALa@m and the ancestor list of each Ai at node N is 
ALa { @n = ALa^m- 

Let Ni : d and Nk : G& be two nodes and A and i? be the selected subgoals in 
d and G&, respectively. We use A < a nc B to denote that A is an ancestor subgoal 
of B. When A is an ancestor subgoal of B, we refer to B as a descendant subgoal 
of A, Ni as an ancestor node of Afc, and Nk as a descendant node of TVj. 

Augmenting SLDNF-trees with ancestor lists leads to the following definition of 
SLDNF*-trees. 

Definition 2.1 (SLDNF* -trees). Let P be a general logic program, G r =<— A r 
a goal with A r an atom, and i? the depth-first, left-most control strategy. The 
SLDNF* -tree Tm t :G t for P U {G r } via i? is defined inductively as follows. 

(1) N r : G r is its root node, and the tree is completed once a node marked as 
LAST is generated or when all its leaf nodes have been marked as U tl Uf or 

a fl- 

(2) For each node Ni :<— L\, L m in the tree that is selected by R, if m = then 
(1) Ni is a success leaf marked as □* and (2) if ALA r @N r ^ then N is also 
a node marked as LAST. Otherwise (i.e. m > 0), we distinguish between the 
following three cases. 

(a) L\ is a positive literal. For each clause B <— Bi, ...,B n in P such that L\ 
and B are unifiable, JV^ has a child node 

N s (Bi, B n , L 2 , L m )9 
where is an mgu (i.e. most general unifier) of L\ and B, the ancestor list 
for each B k 6, k e {1, . . . ,n}, is AL Bk9@Ns = {(JV i5 ii)} U AL il@ jv 4 , and 
the ancestor list for each L^Q, k 6 {2, . . . , m}, is ALl^®^ = ^^l^©^- 
If there exists no clause in P whose head can unify with L\ then Ni has a 
single child node — a failure leaf marked as □/ . 

(b) L\ = ->A is a ground negative literal. Let Tjv i+1 :<-A be an (subsidiary) 
SLDNF*-tree for P U A} via R with ALA@JV i+1 = AL Ll@Ni . We have 
the following four cases: 

i- Tiv 4+i:< -A has a success leaf. Then iVj has a single child node — a failure 
leaf marked as □ / . 

ii. Tjv 4+i:< -a has no success leaf but has a flounder leaf. Then N has a 
single child node — a flounder leaf marked as . 

iii. All branches of Tjv 4+i:< -a end with a failure leaf. Then A^j has a single 
child node 
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N a :<— Lz, L rn 
with AL Lk @ Ns = AL Lk@Nz for each k G {2, . . . , m}. 
iv. Otherwise, Ni has no child node. It is the last node of TM r :G r so that 
it is marked as LAST, 
(c) L\ = -> A is a non-ground negative literal. Then N has a single child node 
— a flounder leaf marked as . 

Starting from the root node N r : G r , we expand the nodes of the SLDNF*-trcc 
TN r :G r following the depth-first order. The expansion for TN r -.G r stops when either 
a node marked as LAST is generated or all of its leaf nodes have been marked as 
□t, □/ or □ /;. 

In this paper we do not consider floundering — a situation where a non-ground 
negative subgoal is selected by R (see the case 2c). See [Chan 1988] for discussion 
on such topic. 

We first prove the following. 

Theorem 2.2. Let T^ i+1[ ^A be a subsidiary SLDNF* -tree built for proving a 
negative subgoal L\ = ->A at a node Ni (see the case 2b). Then AL ^ 0. 

Proof. Note that AL,A®N i+1 — AL Ll @ Ni . Since the subgoal L x at Ni is nega- 
tive, Ni cannot be the root node of the SLDNF*-tree that contains iVj. Therefore, 
Li at Ni has at least one ancestor subgoal (i.e. the subgoal at the root node of the 
tree), which means ALA@N i+1 ^= 0- □ 

In order to solve a top goal Go =*— A , we build an SLDNF*-tree Tn -.^a 
for P U {Go} via R with ALa @n = 0- It is easy to see that Tjv o:< -a is an 
enhancement of the standard SLDNF-tree for P U {Go} via R with the following 
three new features. 

(1) Each node Ni is associated with an ancestor list ALr l .@N i for each Lj of its 
subgoals. In particular, subgoals of a subsidiary SLDNF*-tree T/v i+i:< _ / 4 built 
for solving a negative subgoal L\ = ^A at Ni inherit the ancestor list AL^^Ni 
(see the case 2b). This bridges the ancestor-descendant relationships across 
SLDNF*-trees and is especially useful in identifying infinite derivations across 
SLDNF*-trees (see Example 2.1). Note that a negative subgoal will never be 
an ancestor subgoal. 

(2) In a standard SLDNF-tree, to handle a ground negative subgoal L\ = ->A at Ni 
a full subsidiary SLDNF-tree FT for PU{^- A} via R must be generated. In an 
SLDNF*-tree, however, the subsidiary SLDNF*-tree Tjv i+i: <_4 may not include 
all branches of FT because it will terminate at the first success leaf (see the 
case 2 where by Theorem 2.2 AL,A@N i+1 ^ 0)- The intuition behind this is that 
it is absolutely unnecessary to exhaust the remaining branches of FT because 
they would never generate any new answers for A (and ~^A). Such a pruning 
mechanism embedded in SLDNF*-trees is very useful in not only improving the 
efficiency of query evaluation but also avoiding some possible infinite derivations 
(see Example 2.2). In fact, Prolog performs the same pruning by using a cut 
operator to skip the remaining branches of FT once the first success leaf is 
generated (e.g. see SICStus Prolog [ISLAB 1998]). 
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(3) A well-known problem with the standard SLDNF-tree approach (formally called 
SLDNF-resolution [Clark 1978; Lloyd 1987]) is that for some programs, such 
as P = {A <- -^A} and G =<- A, no SLDNF-trees exist [Apt and Doets 1994; 
Kunen 1989; Martelli and Tricomi 1992]. The main reason for this abnormality 
lies in the fact that to solve a negative subgoal -<A it generates a subsidiary 
SLDNF-tree FT for P U {<— A} via R which is supposed either to contain a 
success leaf or to consist of failure leaves. When FT neither contains a success 
leaf nor finitely fails by going into an infinite derivation, the negative subgoal 
cannot be handled. 

In contrast, SLDNF*-trees exist for any general logic programs. A ground 
negative subgoal ->A at a node Ni succeeds if all branches of the subsidiary 
SLDNF*-tree T^ i+i: ^A end with a failure leaf (see the case 2(b)iii), and fails 
if Tjv 4+i:< _a has a success leaf (see the case 2(b)i) . Otherwise, the value of the 
subgoal -iA is undetermined and thus Ni is marked as LAST, showing that it 
is the last node of the underlying SLDNF*-tree which can be finitely generated 
(see the case 2(b)iv). 1 The tree is then completed here. 

For convenience, we use dotted edges "• • •>" to connect parent and child (sub- 
sidiary) SLDNF*-trees, so that infinite derivations across SLDNF*-trees can be 
clearly identified. Formally, we have 

Definition 2.3. Let P be a general logic program, Go a top goal and R the depth- 
first, left-most control strategy. Let Tm -.g be the SLDNF*-tree for P U {Go} via 
R with ALa @n = 0. A generalized SLDNF-tree for P U {Go} via R, denoted 
GTg , is rooted at No : Go and consists of Tn -.g along with all its descendant 
SLDNF*-trees, where parent and child SLDNF*-trees are connected via "• • •>". 

Therefore, a path of a generalized SLDNF-tree may come across several SLDNF*- 
trees through dotted edges. Any such a path starting at the root node No : Go is 
called a generalized SLDNF-derivation. 

Thus, there may occur two types of edges in a generalized SLDNF-derivation, 
" — ►" and "•■■>". For convenience, we use " =>" to refer to either of them. 
Moreover, for any node N^ : Gi we use L\ to refer to the selected (i.e. left-most) 
subgoal in Gj. 

Example 2.1. Let Pi be a general logic program and G a top goal, given by 

P 1 : p(X) - -np(/(X)). C P1 
G : <-p(a). 

The generalized SLDNF-tree GT^ p r a \ for Pi U {Go} is shown in Figure 1, where 
oo represents an infinite extension. Note that to expand the node N\, we build a 



lr This case occurs when either Tff. -.<—a or some of its descendant SLDNF*-trees is infinite, or 
TN i+1 -.<—A has an infinite number of descendant SLDNF*-trees. Note that LAST is used here 
only for the purpose of formulating an SLDNF*-trec — showing that Ni is the last node of the 
SLDNF*-tree. In practical implementation of SLDNF*-trees, in such a case Ni will never be 
marked by LAST since it requires an infinitely long time to build Tjv i+1 :<— .A together with all of 
its descendant SLDNF*-trees. However, the SLDNF*-tree is always completed at Ni, whether TV; 
is marked by LAST or not, because (1) such a case occurs at most one time in an SLDNF*-tree 
and (2) it always occurs at the last generated node JVj. 
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subsidiary SLDNF*-tree T/v 2 :<-p(/(a))- Since T/v 2:< _ p (/( a )) neither contains a success 
leaf nor finitely fails (i.e. not all of its leaf nodes are marked as □ /), N\ is the last 
node of 7jv 0:< _ p ( o ), marked as LAST. We see that GT„_ p ( ) is infinite, although all 
of its SLDNF*-trees arc finite. 

iVo: <-p(o) 



Nv «--p(/(a)) . .. ^2: «-p(/(a)) 

LAST I C P1 

Nr- - -p(/(/(o))) ..... iV 4 : ^ p(/(/(o))) 
LAST I C P1 



CXO 



Fig. 1. The generalized SLDNF-tree GT^ p(a) . 



Example 2.2. Consider the following general logic program and top goal. 

Pi ■■ P <- C P1 
q- c qi 

q^q. C q2 
G : <-p. 

The generalized SLDNF-tree GT^ P for P 2 U{G } is depicted in Figure 2 (a). GTU P 
consists of two SLDNF*-trees, Tn -.^ p and Tjy 2: ^. q , which are constructed as follows. 
Initially, Tat o:< _ p has only the root node No :<— p. Expanding the root node against 
the clause C Pl leads to the child node N\ :<— -*q. We then build a subsidiary 
SLDNF*-tree 7V 2:< _ g for P 2 U {<— q} via the depth-first, left-most control strategy, 
where the expansion stops right after the node A/3 is marked as LAST. Since T/v 2:< - 9 
has a success leaf, N± gets a failure child node N 5 . Tjv o:< _ p is then completed. 

For the purpose of comparison, the standard SLDNF-trees for P 2 U {<— p} are 
shown in Figure 2 (b). Note that Figure 2 (a) is finite, whereas Figure 2 (b) is not. 

N : ^p N : ^ p N 2 : <- q 

TVi: <- JVi: <- iV 3 : D t /V 4 : 



iVs: n t N 4 : 
LAST 



(a) (b) 



Fig. 2. The generalized SLDNF-tree GT^ P (a) and its two corresponding standard SLDNF-trees 
(b). 
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3. CHARACTERIZING AN INFINITE GENERALIZED SLDNF-DERIVATION 

In this section we establish a necessary and sufficient condition for an infinite gen- 
eralized SLDNF-derivation. We begin by introducing a few concepts. 

Definition 3.1. Let T be a term or an atom and S be a string that consists 
of all predicate symbols, function symbols, constants and variables in T, which is 
obtained by reading these symbols sequentially from left to right. The symbol string 
of T, denoted St, is the string S with every variable replaced by X. 

For instance, let 7\ = a, T 2 = f(X, g(X, f(a, Y))) and T 3 = [X, a}. Then S Tl = a, 
St 2 = JXgXfa-X and St 3 = [<*1MD]]- Note that [X,a] is a simplified representa- 
tion for the list [X|[a|[]]]. 

Definition 3.2. Let St ± and St 2 be two symbol strings. St x is a projection of 
S"t 2 , denoted S^i Qproj St 2 , if Sri is obtained from 5t 2 by removing zero or more 
elements. 

For example, aXXbc Qproj faXeXbXcd. It is easy to see that the relation C pro j 
is reflexive and transitive. That is, for any symbol strings Si, S2 and S3, we have 
Si (=proj S\, and that Si ^proj S2 and S2 Qproj S3 implies Si ^proj S3. 

Definition 3.3. Let Ai = p(.) and A2 = p(.) be two atoms. Ai is said to loop 
into A 2 , denoted Ai ^i oop A 2 , if Sa 1 ^= pr oj Sa 2 - Let TV^ : Gi and Nj : Gj be two 
nodes in a generalized SLDNF-derivation with L\ < a nc Lj and L\ ^i oop L 1 -. Then 
Gj is called a loop goal of G- L . 

The following result is immediate. 

Theorem 3.4. (1) The relation ^i oop is reflexive and transitive. 
{2) If Ai ^ loop A 2 then \Ai\ < \A 2 \. 

(3) If G3 is a loop goal of G2 that is a loop goal of G\ then G3 is a loop goal of 
Gi. 

Observe that since a logic program has only a finite number of clauses, an infinite 
generalized SLDNF-derivation results from repeatedly applying the same set of 
clauses, which leads to infinite repetition of selected variant subgoals or infinite 
repetition of selected subgoals with recursive increase in term size. By recursive 
increase of term size of a subgoal A from a subgoal B we mean that A is B with 
a few function/constant /variable symbols added and possibly with some variables 
changed to different variables. Such crucial dynamic characteristics of an infinite 
generalized SLDNF-derivation are captured by loop goals, as shown by the following 
principal theorem. 

Theorem 3.5. D is an infinite generalized SLDNF-derivation if and only if it 
is of the form 

N :G ^ ...N gi : G gi ...N 92 : C,,. =>• ...JV fls : G g3 ... 
such that for any j > 1, G gj+1 is a loop goal of G gj . 
We need Higman's Lemma to prove this theorem. 2 



2 It is one of the anonymous reviewers who brought this lemma to our attention. 
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Lemma 3.6 (Higman's Lemma [Higman 1952; Bol 1991]). Let{A i ]'?l 1 bean 
infinite sequence of strings over a finite alphabet S. Then for some i and k > i, 

A% ^—proj Afc 

The following result follows from Lemma 3.6. 

Lemma 3.7. Let {Ai} ( ^ 1 be an infinite sequence of strings over a finite alphabet 
E. Then there is an infinite increasing integer sequence {ni}°^L 1 such that for all i 
AC A 

Proof. 3 Suppose this is not true. Let us take a finite maximal subsequence 
Ac AC - C A 

^ni )=proj -^712 —pro j •■• ^=proj -™-n kl 

The subsequence is maximal in the sense that for no i > do we have A nki ^p ro j 
Ai. We know that such a subsequence with length at least 2 must exist from 
Lemma 3.6 and the assumption that the assertion of the lemma does not hold 
for the sequence {Ai}°l 1 . Now look at the elements of the original sequence with 
indices larger than and take another such finite maximal subsequence from 
them. Continuing in this way, we get infinitely many such maximal subsequences. 
Let {A 7lk be the sequence of last elements of the maximal subsequences. By 
Lemma 3.6, this sequence has two elements, A nk and A nk with n\ ti < n^, such 
that A nk , C_ pro j A nk . . This contradicts the assumption that A nk . is the last element 
of some finite maximal subsequence. □ 

The following lemma is needed to prove Theorem 3.5. 

Lemma 3.8. Let D be an infinite generalized SLDNF- derivation. Then there are 
infinitely many goals G gi , G 92 , ... in D such that for any j > 1, Lj < anc Lj +1 . 

Proof. Let D be of the form 

N a : G => JVi : d => ... => N t : G t => N l+1 : G l+1 ... 

c 

Consider derivation steps like Ni : d — ► N i+ i : Gi + i ■ ■ ■ >N i+2 : Gi + 2, where Lj 
is a positive subgoal and L] +1 = -<A a negative subgoal. So L\ +2 = A. We see that 
both L\ and L\ +1 need the proof of L} +2 . Moreover, given L\ +2 the provability 
of L\ does not depend on L\ +1 . Since L\ +1 has no descendant subgoals, removing 
it would affect neither the provability nor the ancestor-descendant relationships of 
other subgoals in D. Therefore, we delete Lj +1 by marking N i+ i with ff. 

For each derivation step Ni : Gi —> Ni + i : Gj+i, where L\ is a positive subgoal 
and C = A <— B\, ...B n such that AO = L\6 under an mgu 6, we do the following: 

(1) If n = 0, which means L\ is proved at this step, mark node Ni with 

(2) Otherwise, the proof of L\ needs the proof of Bj8 (j = l,...,n). If all descendant 
nodes of Ni in D have been marked with which means that all Bj9 have 
been proved at some steps in D, mark node N with ff. 

Note that the root node N will never be marked with for otherwise G would 
have been proved and D should have ended at a success or failure leaf. After the 
above marking process, let D become 



3 This proof is suggested by an anonymous reviewer. 
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N Q : Go => ••• => iV 4l : G 4l ... JV i2 : G l2 => ... => N ik : G lk => ... 

where all nodes except No, N il ,N 2 , N ik , ... are marked with #. Since we use the 
depth- first, left-most control strategy, for any j > the proof of L\ needs the proof 
of Lj. +1 (let io = 0), for otherwise N j would have been marked with That is, 
L,- . is an ancestor subgoal of Lj. , , . Moreover, D must contain an infinite number 
of such nodes because if N ik : G ik was the last one, which means that all nodes 
after N ik were marked with then L\ would be proved, so that N ik should be 
marked with a contradiction. □ 

We are ready to prove Theorem 3.5. 

Proof (Proof of Theorem 3.5). (<=) Straightforward. 

By Lemma 3.8, D contains an infinite sequence of selected subgoals H\ = 
{Lj. such that for any i Lj. -< a nc Lj. 1 ■ Since any logic program has only a finite 
number of predicate symbols, Hi must have an infinite subsequence H 2 = {L\.}°^ 1 
such that all L\. have the same predicate symbol, say p. We now show that H 2 has 
an infinite subsequence {L g .}°^ 1 such that for any i L g . ^i oop L gi i . 

Let T be the (finite) set of all constant and function symbols in the logic program 
and let E = TU { X}. Then the symbol string S L i of each L\. in H 2 is a string over 

E that begins with p. These symbol strings constitute an infinite sequence {pAi}°^L l 
with each Ai being a substring. By Lemma 3.7 there is an infinite increasing integer 
sequence {rii}^ such that for any i pA ni C pro j pA ni+1 . Therefore, H 2 has an 
infinite subsequence H 3 = {L g .} < *L 1 with S L i = pA ni being the symbol string of 
L g .. That is, for any i S L i C pro:j S L i . Thus, for any % L g . ^i oop L g . +i . □ 

4. CHARACTERIZING TERMINATION OF GENERAL LOGIC PROGRAMS 

In [Schreye and Decorte 1993], a generic definition of termination of logic programs 
is presented as follows. 

Definition 4.1 ([Schreye and Decorte 1993]). Let P be a general logic program, 
Sq a set of queries and Sr a set of selection rules. P is terminating w.r.t. Sq and 
Sr if for each query Qi in Sq and for each selection rule Rj in Sr, all SLDNF-trees 
for P U Qi} via Rj are finite. 

Observe that the above definition considers finite SLDNF-trees for termination. 
That is, P is terminating w.r.t. Qi only if all (complete) SLDNF-trees for P U 
Qi} are finite. This does not seem to apply to Prolog where there exist cases in 
which P is terminating w.r.t. Qi and Rj, although some (complete) SLDNF-trees 
for P U Qi} are infinite. Example 2.2 gives such an illustration, where Prolog 
terminates with a negative answer to the top goal Go- 

In view of the above observation, we present the following slightly different defi- 
nition of termination based on a generalized SLDNF-tree. 

Definition 4.2. Let P be a general logic program, Sq a finite set of queries and 
R the depth-first, left-most control strategy. P is terminating w.r.t. Sq and R if 
for each query Qi in Sq, the generalized SLDNF-tree for ?U{^- Qi} via R is finite. 

The above definition implies that P is terminating w.r.t. Sq and R if and only 
if there is no infinite generalized SLDNF-dcrivation in any generalized SLDNF-tree 

ACM Transactions on Computational Logic, Vol. TBD, No. TBD, TBD 20TBD. 



Characterizing Termination of General Logic Programs • 11 

GT^q ; . This obviously applies to Prolog. We then have the following immedi- 
ate result from Theorem 3.5, which characterizes termination of a general logic 
program. 

Theorem 4.3. P is terminating w.r.t. Sq and R if and only if for each query 
Qi in Sq there is no generalized SLDNF- derivation in GT^q i of the form 

N : Go => ...N gi : G gi ...N g2 : G 92 ...N g3 : G 93 ... 

such that for any j > 1, G 9j+1 is a loop goal of G gj . 

5. RELATED WORK 

Concerning termination analysis, we refer the reader to the papers of Decorte, De 
Schreye and Vandecasteele [Schreye and Decorte 1993; Decorte et al. 1999] for 
a comprehensive bibliography. Most existing termination analysis techniques are 
static approaches, which only make use of the syntactic structure of the source code 
of a logic program to establish some well-founded conditions/constraints that, when 
satisfied, yield a termination proof. Since non-termination is caused by an infinite 
generalized SLDNF-derivation, which contains some essential dynamic characteris- 
tics that are hard to capture in a static way, static approaches appear to be less 
precise than a dynamic one. For example, it is difficult to apply a static approach 
to prove the termination of program P 2 in Example 2.2 with respect to a query 
pattern p. 

The concept of generalized SLDNF-trees is the basis of our approach. There are 
several new definitions of SLDNF-trees presented in the literature, such as that 
of Apt and Doets [Apt and Docts 1994], Kunen [Kunen 1989], or Martelli and 
Tricomi [Martelli and Tricomi 1992]. Generalized SLDNF-trees have two distinct 
features as compared to these definitions. First, the ancester-descendent relation 
is explicitly expressed (using ancestor lists) in a generalized SLDNF-tree, which is 
essential in identifying loop goals. Second, a ground negative subgoal ->A at a node 
Ni in a SLDNF*-tree TN r -.o r is formulated in the same way as in Prolog, i.e. (1) the 
subsidiary SLDNF*-tree Tjy i+i: ^A for the subgoal terminates at the first success 
leaf, and (2) ->A succeeds if all branches of Tjv i+i:< -A end with a failure leaf and 
fails if Tjv i+i: <-y4 has a success leaf. When Tjv i+i: <-A goes into an infinite extension, 
the node Ni is treated as the last node of Tjv r :G r which can be finitely generated. 
As a result, a generalized SLDNF-tree exists for any general logic programs. 

Our work is also related to loop checking — another research topic in logic pro- 
gramming which focuses on detecting and eliminating infinite loops. Informally, a 
derivation 

A, : G .V, : C : > ... =► N : G; ... => N k : G k => ... 

is said to step into a loop at a node N k : Gk if there is a node N : Gi (0 < i < k) 
in the derivation such that Gi and Gk are sufficiently similar. Many mechanisms 
related to loop checking have been presented in the literature (e.g. [Bol et al. 
1991; Shen et al. 2001]). However, most of them apply only to SLD-derivations 
for positive logic programs and thus cannot deal with infinite recursions through 
negation like that in Figure 1 or 2. 

Loop goals are defined on a generalized SLDNF-derivation for general logic pro- 
grams and can be used to define the sufficiently similar goals in loop checking. For 
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such an application, they play a role similar to expanded variants defined in [Shen 
et al. 2001]. Informally, expanded variants are variants except that some terms 
may grow bigger. However, expanded variants have at least three disadvantages 
as compared to loop goals: their definition is less intuitive, their computation is 
more expensive, and they are not transitive in the sense that A being an expanded 
variant of B that is an expanded variant of C does not necessarily imply A is an 
expanded variant of C . 

6. CONCLUSIONS AND FUTURE WORK 

We have presented an approach to characterizing termination of general logic pro- 
grams by making use of dynamic features. A concept of generalized SLDNF-trees 
is introduced, a necessary and sufficient condition for infinite generalized SLDNF- 
dcrivations is established, and a new characterization of termination of a general 
logic program is developed. 

We have recently developed an algorithm for automatically predicting termina- 
tion of general logic programs based on the characterization established in this 
paper. The algorithm identifies the most-likely non-terminating programs. Let P 
be a general logic program, Sq a set of queries and R the depth-first, left-most 
control strategy. P is said to be most-likely non-terminating w.r.t. Sq and R if for 
some query Qi in Sq, there is a generalized SLDNF-derivation with a few (e.g. two 
or three) loop goals. Our experiments show that for most representative general 
logic programs we have collected in the literature, they are not terminating w.r.t. 
Sq and R if and only if they are most-likely non-terminating w.r.t. Sq and R. 
This algorithm can be incorporated into Prolog as a debugging tool, which would 
provide the users with valuable debugging information for them to understand the 
causes of non-termination. 

Tabled logic programming is receiving increasing attention in the community of 
logic programming (e.g. [Chen and Warren 1996; Shen et al. 2002]). Verbaeten, Dc 
Schreye and Sagonas [Verbaeten et al. 2001] recently exploited termination proofs 
for positive logic programs with tabling. For future research, we are considering 
extending the work of the current paper to deal with general logic programs with 
tabling. 
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